Identifying Cases of Type 2 Diabetes in Heterogeneous Data Sources: Strategy from the EMIF Project
نویسندگان
چکیده
Due to the heterogeneity of existing European sources of observational healthcare data, data source-tailored choices are needed to execute multi-data source, multi-national epidemiological studies. This makes transparent documentation paramount. In this proof-of-concept study, a novel standard data derivation procedure was tested in a set of heterogeneous data sources. Identification of subjects with type 2 diabetes (T2DM) was the test case. We included three primary care data sources (PCDs), three record linkage of administrative and/or registry data sources (RLDs), one hospital and one biobank. Overall, data from 12 million subjects from six European countries were extracted. Based on a shared event definition, sixteeen standard algorithms (components) useful to identify T2DM cases were generated through a top-down/bottom-up iterative approach. Each component was based on one single data domain among diagnoses, drugs, diagnostic test utilization and laboratory results. Diagnoses-based components were subclassified considering the healthcare setting (primary, secondary, inpatient care). The Unified Medical Language System was used for semantic harmonization within data domains. Individual components were extracted and proportion of population identified was compared across data sources. Drug-based components performed similarly in RLDs and PCDs, unlike diagnoses-based components. Using components as building blocks, logical combinations with AND, OR, AND NOT were tested and local experts recommended their preferred data source-tailored combination. The population identified per data sources by resulting algorithms varied from 3.5% to 15.7%, however, age-specific results were fairly comparable. The impact of individual components was assessed: diagnoses-based components identified the majority of cases in PCDs (93-100%), while drug-based components were the main contributors in RLDs (81-100%). The proposed data derivation procedure allowed the generation of data source-tailored case-finding algorithms in a standardized fashion, facilitated transparent documentation of the process and benchmarking of data sources, and provided bases for interpretation of possible inter-data source inconsistency of findings in future studies.
منابع مشابه
Comparing Three Data Mining Algorithms for Identifying the Associated Risk Factors of Type 2 Diabetes
Background: Increasing the prevalence of type 2 diabetes has given rise to a global health burden and a concern among health service providers and health administrators. The current study aimed at developing and comparing some statistical models to identify the risk factors associated with type 2 diabetes. In this light, artificial neural network (ANN), support vector machines (SVMs), and multi...
متن کاملIdentifying and Analyzing Coordination Barriers in the Context of Urban Infrastructure Provision in Iran A Qualitative Multiple Case Study
Introduction: Urban infrastructure systems provide foundations for modern civil communities and enhance the quality of life. Coordination between different urban infrastructure agencies involved in urban infrastructure provision plays a significant role in the success of these critical urban sub-systems. It brings together various independent agencies to make their endeavors more accordant. In ...
متن کاملEnzyme Assay Guided Isolation of an α-Amylase Inhibitor Flavonoid from Vaccinium arctostaphylos Leaves
The management of postprandial hyperglycemia is an important strategy in the control of diabetes mellitus and complications associated with the disease, especially in the diabetes type 2. Therefore, inhibitors of carbohydrate hydrolyzing enzymes can be useful in the treatment of diabetes and medicinal plants can offer an attractive strategy for the purpose. Vaccinium arctostaphylos leaves are c...
متن کاملIdentifying the pattern of the talent management as the winning strategy of the organization; A study in the National Iranian South Oil Company
This study was conducted to identify the dimensions, components and indices of the talent management in the National Iranian South Oil Company. The study has been considered an applied research in terms of its purpose and in terms of data was qualitative and it has been done based on grounded theory in terms of the nature of the implementation. The statistical population of the study was expe...
متن کاملDesigning a glycemic control strategy to maintain glucose homeostasis and prevent hypoglycemia for subjects with type 1 diabetes
This paper presents using the fractional PImDn controller module which manipulates insulin infusion rate to maintain normoglycemia in subjects with type 1 diabetes. To prevent severe hypoglycemia, a conventional proportional controller is used to regulate glucagon infusion rate when the blood glucose levels fall below a threshold. Two sets of controller parameters are obtained and evaluated. Fo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 11 شماره
صفحات -
تاریخ انتشار 2016